Picture for Yixin Cao

Yixin Cao

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

Add code
May 29, 2026
Viaarxiv icon

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

Add code
May 28, 2026
Viaarxiv icon

ATLAS: All-round Testing of Long-context Abilities across Scales

Add code
May 27, 2026
Viaarxiv icon

Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents

Add code
May 27, 2026
Viaarxiv icon

DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes

Add code
May 27, 2026
Viaarxiv icon

Bridging the Detection-to-Abstention Gap in Reasoning Models under Insufficient Information

Add code
May 27, 2026
Viaarxiv icon

Beyond Literal Translation: Evaluating Cultural Effectiveness in Social Media UGC

Add code
May 25, 2026
Viaarxiv icon

SliceGraph: Mapping Process Isomers in Multi-Run Chain-of-Thought Reasoning

Add code
May 14, 2026
Viaarxiv icon

Reinforcement Learning with Conditional Expectation Reward

Add code
Mar 11, 2026
Viaarxiv icon

GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Add code
Mar 02, 2026
Viaarxiv icon